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Abstract 

In this paper we study the distribution of stacks in /c-noncrossing, r-canonical RNA 
pseudoknot structures ({k, restructures). An RNA structure is called /c-noncrossing 
if it has no more than k — 1 mutually crossing arcs and r-canonical if each arc is 
contained in a stack of length at least r. Based on the ordinary generating function 
of {k, restructures [10] we derive the bivariate generating function Tfc T (x,u) = 
En>oEo<i<i''")sr( ,l i*) u ' :,;, 'i where Tfc jT (n,t) is the number of (k, restructures 
having exactly t stacks and study its singularities. We show that for a certain 
parametrization of the variable u, Tk, T ( x i u ) has a unique, dominant singularity. 
The particular shift of this singularity parametrized by u implies a central limit 
theorem for the distribution of stack-numbers. Our results are of importance for 
understanding the "language" of minimum-free energy RNA pseudoknot structures, 
generated by computer folding algorithms. 

Key words: /c-noncrossing RNA structure, pseudoknot, generating function, 
singularity analysis, central limit theorem 



1 Introduction 



In this paper we compute the bivariate generating function 

TV(x, M ) = ]T Y, T fc)T (M)«V> (i) 

n>0 0<t<! 
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where Tfc jT (n, t) is the number of /c-noncrossing, r-canonical structures having 
exactly t stacks. Furthermore we prove the central limit theorem 

r m ( X k,r-^k,rn \ 1 r* _ if2 

hm P — , — < x \ = —^= / e at , (2) 

where X£ T is a random variable with distribution 

PW >T = t) = T fc)T (n,t)/T fc)T (n), (3) 

see Fig. [T]and Tab. [TJ Our results allow to compute the stack-numbers of a ran- 
dom fc-noncrossing, r-canonical RNA structure. Since the latter are obtained 
via the reflection principle, it is nontrivial and currently not known how to 
construct such structures with uniform probability in polynomial time. 

An RNA molecule is a sequence of the four nucleotides A, G, U and C together 
with the Watson-Crick (A-U, G-C) and (U-G) base pairing rules. RNA 
molecules form "helical" structures by pairing nucleotides and thereby low- 
ering their minimum free energy (mfe). The biochemistry of these nucleotide- 
pairings favors parallel stacking of bonds due to mesomerie effects. The re- 
sulting 3-dimensional configuration of the nucleotides is the RNA tertiary 
structure which determines the functionality of the molecule. 

In this paper we study the distribution of stacks in A>noncrossing, r-canonical 
RNA structures. We represent RNA structures as diagrams and identify Watson- 
Crick (A-U, G-C) and (U-G) base pairings with arcs drawn in the upper 
halfplane, ignoring the bonds of the primary sequence. A diagram is a graph 
over the vertex set [n] = { 1 , . . . , n} in which each vertex has degree less or 
equal to one. It is represented by drawing the vertices in a horizontal line 
and its arcs where i < j, in the upper half-plane, see Fig. [2j The ver- 

tices and arcs correspond to nucleotides and Watson-Crick (A-U, G-C) and 
(U-G) base pairs, respectively. Diagrams have the three key parameters k, A 
and r. Here A; — 1 is the maximum number of mutually crossing arcs, A is the 
minimum arc-length and r the minimum length of a stack. A A-arc is an arc 
where j — i = A and a stack of length r is a sequence of "parallel" arcs: 
((i, j), (i + 1, j - 1), . . . , (i + (r - 1), j - (r - 1))). 

We call a /c-noncrossing diagram with arc-length A > 4 and stack-length r > 3 
a /c-noncrossing, r-canonical RNA structure ((A;, restructure). We denote the 
number of (A;, restructures and those with exactly t stacks by T fe)T (n) and 
T/- r (n, t), respectively. For k = 2 this recovers the well known RNA secondary 
structures Pll3|ll4lll5j . Of course, the interesting cases are k > 3, where we 
allow for crossings, i.e. the RNA pseudoknot structures. However, to the best 
of our knowledge, our results are-even for RNA secondary structures new. 
In Fig. [3] we give two representations of a typical (3, 3( RNA pseudoknot 
structure. 



2 



The results of this paper are relevant for the understanding of mfe RNA pseu- 
doknot structures. Computer folding algorithms, like for instance, the dy- 
namic programming routine of Rivas and Eddy [IT], can generate, at least for 
n < 120, mfe-pseudoknot structures reasonably fast. While it is well known 
how to generate a random RNA secondary structure, the particular construc- 
tion hinges on the fact that the latter can be build inductively. Although 
(A;, restructures are D-finite non inductive recursion exists [7f8] and it is at 
present time not known how to generate them with uniform probability. Our 
findings are therefore of particular importance for understanding the "lan- 
guage of RNA" , generated by ab initio folding algorithms. 



2 Some basic facts 



In the following, we shall identify RNA pseudoknot structures with /c-non- 
crossing diagrams with minimum arc- length > 4 and stack- length r > 3. From 
now on we will always assume that any structure has a minimum arc-length 
greater than four. Let T kr {n) (T fer (n)) denote the set (number) of (k, r)- 
structures over [n]. Furthermore, let T kr {n,t) and T kr (n, t) denote the set 
(number) of (k, restructures having exactly t stacks. In addition, let T kr (n, h) 
and T fcT (n, h) denote the set (number) of (k, r) -structures having exactly h 
arcs. The generating function, T kr (x) = J2n>o ^ k T (n)x n , where k > 2, r > 3 
of {k, restructures has been obtained in [10]. The function is closely related 
to Ffc(x) = J2n fk{^n)x 2n , the generating function of fc-noncrossing matchings 
Beyond functional equations implied directly by the reflection-principle 
the following asymptotic formula [H] will be of importance 

VfcGN; f k (2n) ~ c k n -^-i?+(k-i)/2) ^ k _ ^yan c k > . (4) 
T fer (a;) is given as follows: 

Theorem 2.1 Let k > 2, r > 3, x be an indeterminate and p k the dominant, 
•positive real singularity ofF k (z). Then T kr (x) is given by 

1 / \/wq(x) x\ 

where wq(x) = 1 _^2 + ^2t an d v o(x) = 1 — x + w (x)x 2 + w (x)x 3 + Wq{x)x a . 
Furthermore, 

T (n)~c k n-^ 2 -^ (—\ , for k = 2,3, ... ,9 (6) 

\%,tJ 
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holds, where 7 fc T is the minimal positive real solution of the equation VQ ^ — = 
Pk- 

In addition we shall make use of the following basic theorem on limit distri- 
butions: 

Theorem 2.2 (Levy-Cramer) Let {£„} be a sequence of random variables 
and let {(p n (x)} and {F n (x)} be the corresponding sequences of character- 
istic and distribution functions. If there exists a function <p(t), such that 
lim^oo (f n (t) = ip(t) uniformly over an arbitrary finite interval enclosing the 
origin, then there exists a random variable £ with distribution function F(x) 
such that 

F n (x) F{x) 

uniformly over any finite or infinite interval of continuity of F(x). 



3 Combinatorics of stacks 



The objective of this section is to compute the bivariate generating function 

T k Ax,u) = Y: E T fc >,0«W (7) 

n>0 0<t<f 

For this purpose let us recall the concept of fc-noncrossing cores developed in 
[8]. A /c-noncrossing core is a fc-noncrossing diagram in which all stacks have 
size one. We denote the set and the number of cores over [n] by Ck(n) and 
Cfc(n), respectively. Furthermore, let Ck(n,t) and Ck(n,t) denote the set and 
number of cores having exactly t stacks. We consider the arc-sets 

02 = {(i, i + 2) | i + 1 is isolated } and /5 3 = {(i,i + 3) | i + 1, i + 2 are isolated } 

and set (3 — 02 U fo. Furthermore, let 



C^(n, t) — {£ | C £ Cjt(n, t); ( contains no /3-arc } (8) 
T kl (n, h) = | ( G Tk^n, h); ( contains no /3-arc } (9) 

where Xfci(n, h) denotes the set of /c-noncrossing diagrams without 1-arcs hav- 
ing exactly h arcs. We set Q(n,0) = T* kl (n, 0) = 1 for < n. In our first 
theorem we shall show that the number of all (k, restructures with exactly t 
stacks is a sum of the number of C£(n', t)-diagrams with positive integer co- 
efficients. Let fk{m,£) denote the number of fc-noncrossing partial matchings 
having exactly I isolated points. 
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Theorem 3.1 Suppose we have k, t, r e N, k > 2, r > 3 and 1 < t < \n/2\ . 
Then 

\n/2\ , (1 _ \ f + h _ 1 \ 

T fc >,t)=E Q(n + 2i-2M) (10) 

h>tr \ I 

*0 = gVl)^ 1 j X ) -2/z + 2fe + 2,6+l) 

/or /i > 1. 
Furthermore, T kl (n,h) satisfies 



(11) 



■n>, /o = e c-i)*^- 

0<ji+i2+i3<ft (12) 
A(n,ji, j 2 , j 3 ) /fc(n - 2ji - 3j 2 - 4j 3 , n - 2h - j 2 - 2j 3 ) 

u ■ ■ . x / n-j 1 -2j 2 -3j 3 \ 
A(n,ji,j 2 ,J 3 ) = . . . . 

\Ji,]2,]3,n - 2ji - 3j 2 - 4j 3y / 

Proof. First, there exists a mapping from (A;, r)-structures with t stacks into 
C* k (n-2(h-t),t): 

c: T k>T (n,t) ^ |J Cl(n-2{h-t),t), ( » c(C) (13) 

*T<ft<LfJ 

The core, c(£), is obtained in two steps: first we map arcs and isolated vertices 
as follows 

W>t-1; j+£), i-> andj i-> j if j is isolated. (14) 

Second we relabel the vertices of the resulting diagram from left to right 
in increasing order. That is we replace each stack by a single arc, keep iso- 
lated vertices and then relabel, see Fig. HI It is straightforward to verify that 
c: T k r (n, t) — > UtT<h<i%\ Cl{n — 2{h — t), t) is well-defined and surjective. Tak- 
ing into account that each stack has its specific length gives rise to consider 

f k>T :T k)T (n,t)^ |J (C* k (n + 2t-2h,t)x 

tT<h<\%\ 

(15) 

{(0ii)l<i<t | E Q i = h ~ Q i - T ~ 1}) 
i=l 

given by fk, T (0 = ( C (C) 5 ( a t)i<i<*)- By construction f kjT is well-defined and a 
bijection: obviously we can reconstruct the original T k T (n, £)-element from its 
core and the sequence of its stack-multiplicities (labeling the stacks from left 
to right). Computing the multiplicities of the resulting cores we derive 



'(i-rjt + fc-r, 



{(*i)i<i<t I E a * = h - t;aa > r - 1 



8=1 



t- 1 
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We can therefore conclude 



\n/2\ , (1 _ \ f + h _ 1 \ 

T fc >,t)=W l l_\ )C k *(n + 2t-2h,t), (17) 



h>tr 



whence eq. ( II Oft . Next we switch and consider arcs instead of stacks, using the 
parameter h. Again, collapsing the stack induces the mapping 

c* : T*>, h) -> U„<6£fc-i °*(" -M,h-b) C -> c*(C) (18) 

Indeed, c* is well defined, since a T^ 1 (n, /i)-diagram can be mapped into a 
core structure without 1- and (3- arcs, i.e. into an element of Cl(n',h'). It 
is straightforward to show that c* is surjective. Labeling the h — b stacks of 
C G /i) from left to right and keeping track of multiplicities gives rise 

to the map 



m k : T fc *(n, h) 



u 



h-b 



Cl(n -2b, h-b) X I {(Xi)i<i< h ^ b \ E a * = ^ a * ^ 



0<b<h-l L k i=l 



(19) 



given by m k (Q = (c*(£), (aj)i<i</i-&)- The mapping m k is well-defined and a 
bijection. Clearly, 



h-b 

{a>i)l<i<h-b | ^ a i — b] dj > 



i=l 



7l-l N 

. 6 



(20) 



and eq. (fT9l and eq. (1201 imply 



TyK fc) = £ (* 6 X ) Q(n - 26, /i - 6). (21) 



b=0 



Via Mobius-inversion eq. (fTTl) follows. It is straightforward to show that there 
are 

A(n,ji,j 2 ,j 3 ) =... 

\Ji,j2,j3,n- 2j! - 3j 2 - 4j 3 y 

ways to select j\ 1-arcs, j 2 /3 2 -arcs and j 3 /3 3 -arcs over [n]. Since removing ji 
1-arcs, j 2 /5 2 -arcs and j 3 /5 3 -arcs removes 2j x + 3j 2 + 4j 3 vertices, the number 
of configurations of at least ji 1-arcs, j 2 /? 2 -arcs and j 3 /5 3 -arcs is given by 
K n Ju32,33)fk(n - 2ji - 3j 2 - 4j 3 ,n - 2h - j 2 - 2j 3 ). Via the inclusion- 
exclusion principle, we arrive at 

TUn,h)= e c-i)* + * + * 

o<ji+j2+j 3 <fr (22) 
A(n, ji, j 2 , j 3 ) /fc(n - 2ji - 3j 2 - 4j 3 , n - 2h - j 2 - 2j 3 ), 
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whence Theorem 13. 11 □ 

Of course, Theorem 13. II implies a functional relation for Tfc ]T (x, u): 

Lemma 3.2 Let k > 2, r > 3 and let u, x be indeterminates. Then we have 
the functional relation 



ux 



E E T fc >,tyx" = E E Q(n,t) ^ 

n>0 0<t<LfJ n>0 0<t<L§J \ 



2(r-l)' 



X 



x- 



(23) 



and in particular, for u = 1 



E T fc >K = E E Q(n,f) 

n>0 n>0 0<t<LfJ V 



2(r-l)\ * 



(24) 



Proof. We set 



E 

i>l 



E Q(m) 



n>2tr 



t>l 



and proceed by deducing the functional equation for T fc T (x, m) via Theo- 
rem 13.11 For this purpose we note that for t — the Binomial coefficient 
(^ 1- 't-'i' l ~ 1 ) * s zero ) while the term T k T (n, 0) = 1 for n > 1. Clearly, T fc r (n, 0) = 
1 counts for each n > 1 the structure consisting only of isolated vertices. We 
accordingly have to extend the identity of Theorem 13.11 



\n/2\ //j _ )f h _ ]N 

T fc >,t)= E 1 , i )Q(n + 2t-2/ i ,t) 



t- 1 



to the case t — 0, n > 1 which gives rise to the term J2 n >i xU = ~rh- Accord- 



1-2 • 



ingly, we derive 



Ln/2j 

£ E V(^*)^ n =E E E Q(n + 2t-2M) x 

n>00<t<LfJ ™>2 i<t<L| J /i>*t 

n>0 



(25) 



t- 1 



We rewrite the right hand side of eq. ([25]) 



t- 1 



1 — x 



[n/2i {{\ -r)t + h-l\ 1 

EE E C* k (n + 2t-2h,t)x n+2t - 2h l [ J + Wx 2h - 2t < 

t>l h>tr n>2tr 
[n/2\ 

E E ^(x) 

t>l h>tr 



[l-r)t + h-l s 

t-1 



u l x 2h - 2t + 



1 — X 
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Rearranging the terms of the summation we obtain 



t>l h>tr 



Using J2n (™nJ xH = n _Afe+i > we can transform the summation over h and 
derive 



E E T fc >,tyx" - 

n>0 0<t<|S| *>1 Vi 



X 2 J v 1 — X 



^ x 2(r-l)\* 



E E \—r 

n>0 0<i<[fj V 1 X 



X 



and the proof of Lemma 13.21 is complete. □ 

We next consider a functional equation for Yln,h~^k( n J h)u h x n proved in [8]. 

Lemma 3.3 Let k, r G N, k > 2 and let u,x be indeterminates. Suppose we 
have 

Vh>l, \ T (n,h) = 

E ( b + ^ ^ ~ X ) ^ n ~^ h - 6 ) and A *>' °) = 1 • 



6=r-l 



(26) 

T/ien we /iai>e t/ie functional relation 



n>0 0<ft<§ n>0 0</i<§ V / 



Combining Lemma 13.31 and T* kl {n,h) = J2b=o \ bj Q ( n ~ 26, /i — b) we arrive 
at 



E E ^(n^V^E E Cl{n,h)(^-X^. {2i 



n>0 0<h<f n>0 0<h<f 



We shall make use of an additional relation for En>o Eo<k§ ~n£ i(n, h)u h x n , 
proved in [TU] : 
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Lemma 3.4 Let k G N, k > 2, and x, w be indeterminates. Then we have the 
functional relation 

E E TU^h)w h x n = ^- J2Wn) (^) 2n , (29) 

where v(x) = l—x+wx 2 +wx 3 +wx A and fk(2n) is the number of k-noncrossing 
matchings over 2n vertices. 

We are now in the position to present the main result of this section: 

Theorem 3.5 Let k > 2, r > 3 and suppose u, x are indeterminates. Then 
we have the identity of formal power series 

E E T fe >, t)„V = -1- E /*(2«) f ) ^ , (30) 

n>0 0<t<§ U \ X J n>0 \ U W / 

where v(x) and u = Uq(x,u) are given by 



U0 = ux^- X i + 1 ^ 
v(x) = 1 — x + ^o^ 2 + ^o^ 3 + uox 4 . (32) 

In particular, we can consider eq. (3W as a relation between analytic functions, 
valid for u = e s and \s\ < e for e sufficiently small and \x\ < 1/2. 

Proof. We interpret the bivariate generating function 



E E 

n>0O</i<2 



h x n 



in two different ways: first, via Lemma [3.21 we have 



E E T fc>T (n, = £ E Q(M) ^— (33) 

n>00<t<LfJ n>00<i<LfJ V ' 

and second according to eq. (|28|) : 



E E n>,/iK**=5: E Q(n,fc)(— ^) x\ (34) 

n>0 0<h<f n>0 0<K^ 



Note that, by definition of cores, we may replace the index "/i" by "i" in the 
right hand side of eq. (I3~4"l) . The key observation is the relation between the 
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terms "f^,' 1 and , "° 2 • Using the Ansatz 

T^r = 13^' ( 35 ) 

we obtain the unique solution u = 1 "^'l^L ■ Accordingly, we conclude 



E E T fe >,t)«v> = E E Q(n,t) ^ 

n>0 0<t<LfJ n>00<t<LfJ \ 

= E E 7lMh)< 



ux 2 ^Y 



X 2 



h x n 



n>0 0</i<f 



where = 1 — a; + n x 2 + u x 3 + u x 4 , and u = ^r^ x 2 +1 ■ in order to 
consider eq. (15U1) as a relation between analytic functions we need to satisfy 
ux 2t — x 2 + 1 0. Suppose it = e s , |s| < e for e sufficiently small. From 
|x| < 1/2 and the continuity (in s) of the roots of the family of polynomials 

lo s (X) = e s X 2r — X 2 + 1, |s|<e 

we conclude u s (x) ^ 0. Therefore eq. (1301) holds for w = e s , \s\ < e and 
sufficiently small e and |x| < 1/2. □ 



4 The central limit theorem 



Suppose e > 0, 2 < k and w is parametrized as u = e s , where \s\ < e. We set 



fn,k,r(s) = T ktT (n, t)e tS (36) 
U fc (z, S) = Vn,kA S ) zn - ( 37 ) 



n>0 



We will use Theorem 13. 5^ which relates the generating functions Ufc(z, s) and 
Ffc(^) = Ysn f(2n)z 2n in order to study the singularities of XJk(z,s). 

Theorem 4.1 Suppose e>0,2</c<9 ; 3<r<7 and u = e s , where 
\s\ < e. Then the following assertions hold: 

(a) For sufficiently small e, \Jk(z,s) has the unique singularity, 7 fcT (s), which 
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is analytic in s and the unique minimal real positive solution of 



pk = 0. (3£ 



1 — z + u (s)z 2 + u (s)z 3 + u (s)z 4 
(b) T/ie coefficients of\J k (z,s) are asymptotically given by 



[z"}V k (z,s) = A(s)(l-0(n- 1 ))n-^ 2+ ^^—!—) , A(s) G C, (39) 



uniformly in s in a neighborhood ofO. In particular, the subexponential factors 
of the coefficients ofTJ k (z,s) coincide with those ofF k (z) and are independent 
of s. 

Proof. According to Theorem 13.51 we have for \s\ < e 



2n 



1 / \/Uq(s) Z , 

where v(z,s) = 1 - z + u (s)z 2 + u (s)z 3 +u (s)z 4 and u (s) = e fjrlJ +1 ■ We 
set 



\Juo(s) z 

^ S)= l-z + u Q {s)z 2 + u (s)z 3 + u (s)z* (41) 



2/i 



JUn(S) Z . 

W k (z,s) = J2M 2n ) \ \ \ ■ (42) 



n>0 



v(z,s) 



Our first objective is to prove the existence of the unique singularity, j kr (s). 
For this purpose we consider 

F(z, s) = ip T (z, s) - p k . (43) 

For s = and 2<A;<9, 3<r<7itis straightforward to verify that a unique 
minimal real solution, u, exists. We observe that for \s\ < e the following holds: 
(i) F(u,0) = 0, (ii) F z (u,Q) ^ and (iii) the partial derivatives F z (z,s) and 
F s (z,s) are continuous. According to the analytic implicit function theorem 
[I], there exists in a sufficiently small neighborhood of 0, a unique analytic 
function j kT (s), that satisfies 

Ws; \s\ < e; F(j ktT (s), s) = and 7 fejT (0) = u. 

Claim 1. For e sufficiently small and \s\ < e, 7j. r (s) is a dominant singularity 
of U k (z, s). 

Let £(s) be a dominant singularity of U&(z, s). It is straightforward to prove 
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that £(0) is necessarily a singularity of W k (z, 0). We proceed by applying 
an continuity argument. For e sufficiently small and \s\ < e the singularities 
of and 7& r (s) are both continuous in s. Therefore we can conclude 

from our observation for s = that, for sufficiently small e, all singularities of 
v(z, s) _1 have modulus strictly larger than 7 fc T (s). Accordingly, for sufficiently 
small e, 7 fcT (s) is a dominant singularity of Uk(z, s) , whence Claim 1. 
Claim 2. 7 fcr (s) is unique. 

Functional relations arising from the reflection principle j5] imply that the 
generating function F k (z) is D-finite [12]. Accordingly there exists some e G N 
for which F k (z) satisfies an ODE of the form 

d e d e ~ l 
%M—Y k {z) + <li,k{z)-j—F k {z) + q e , k {z)F k {z) = 0, (44) 

where qj,k(z) are polynomials. Any dominant singularity of F k (z) is contained 
in the set of roots of qo,k{z) [12] and via direct computation we can verify that 
7 feT (0) is the unique solution with minimal modulus of all solutions of 

WM) = |P*I- (45) 

The analytic implicit function theorem applied to eq. ( l43l) now implies locally 
the existence of an unique analytic function 7 fcT (s) solving i[) T (z,s) = p k . 
Using continuity we conclude from the fact that 7 fcr (0) is the unique solution 
with minimal modulus of all solutions of ip T (z, 0) = \p k \, that for e sufficiently 
small, 7fc T (s) is the unique solution with minimal modulus of all solutions of 
ip T (z, s) = \pk\- This implies Claim 2 and (a) follows. 
We finally prove (b). According to eq. (jl]) we have 

f k (2n) ~ Cfcn -((fc-i) 9 +(fc-D/2) ( 2 (k-l)) 2n 
for some c k > and 



F k (z) 



0((z - Pfc )(fc-i) 2 +(fc-D/2-i i n ( z _ pfc )) f or k odd; z ^ pk 
0{(z - p^'^Hk-m-i) f or k even, z -> p fc 



in accordance with basic structure theorems for singularities of solutions of 
eq. (jUJ) [1], p. 499. Let Q7 fc T (s)( z , s) denote the singular expansion of U k (z, s) 
at 7fe, r (s). We have shown in Claim 1 that ip T (z, s) does not induce any dom- 
inant singularities and is regular at p k . Let Q Pk (z) denote the singular expan- 
sion of F k (z) at the dominant singularity p k , i.e. F k (z) = 0(Q Pk (z)) for z — > p k . 
The singular expansion of the compositum, F k (ip T (z, s)), i.e. Q lk T ( s )(z, s), is 
derived by substituting the Taylor-expansion of ip T (z,s) into Q Pk (z) and we 
observe 

Qyk,A>)( z > s ) = Qp k (M(k(s), s)) = 0(Q 7fc T(s) (2;)) . (46) 
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Indeed, eq. (1461) follows immediately substituting t/) T (z,s) — ^ T (7fc iT (s), s) for 
z — Pk which does not change the singular expansion. Since ip T (z, s) is contin- 
uous in s the singular expansion is uniform with respect to the parameter s 
and the uniformity lemma of singularity analysis [I] implies 

[z n ] U k (z, s) = A(s) (1 - 0(l/n)) n -((*-i) 2 +(*-D/2) (—L-) (47) 

for some A(s) G C, uniformly in s in a neighborhood of 0. Therefore the 
asymptotic expansion is uniform in s and eq. (1391) follows. The proof shows in 
addition that the subexponential factors of the coefficients of Ufc(z, s) coincide 
with those of Ffc(z) and are independent of s. □ 

We now consider the random variable X% having the distribution 

P(X£ T = t) = T fc)T (n, f)/T fc)T (n) (48) 

where t = 0, 1, . . . [n/2j. We shall show in the following theorem that the 
distribution of X^ T is determined by the shift of the singularity parametrized 
by s. In other words, Theorem 14.11 induces the below central limit theorem. 
The proof-idea of Theorem 14.21 is due to Bender, where it appeared in slightly 
different context pp. It follows from analyzing the characteristic function, using 
the Levy-Cramer Theorem of Section [2J 

Theorem 4.2 For any 2 < k < 9 and 3 < r < 7 there exist a pair (jJ>k,T, crfc,r) 
such that the normalized random variable 

y Z = ^LZjtf (49) 



has asymptotically normal distribution with parameter (0, 1). That is we have 
I Xl \ " < x) = J= r e~^dc, (50) 



n— >oo 



nat 



k,T 

■2 



where fik, r and a^ T are given by 

7U0) 2 (i'k,M\ 2 KU 



Proof. Suppose we are given the random variable (r.v.) with mean \x n and 
variance a\. We consider the rescaled r.v. r\ n = — fi^a' 1 and the charac- 
teristic function of r\ n : 

/, n (c)=E[e^]=E[e^]e-^ c . (52) 
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Writing X n instead of X^ T we derive for £ n = X n , substituting for the term 

^ T fc , r (n,t ) 
K h) T fciT (n) 

In view of <p n ,kA s ) = £<<n/2 Tfc, T (n, t)e ts we interpret 



fx n {c) 



— I —C 



(53) 



Vn,fc,r(0) = T fclT (7i,t) and <p n ,kA( ic )/M) = T kyT {n,t)e t{ic)/{an) 

t<n/2 t<n/2 



Writing instead of f n ,k,r, we accordingly obtain 

1 



/a»(c) 



V? n (0) \(T. 



e ct " 



(54) 



Now we have to provide the interpretation of y? n (0) and <p n ((ic) / (cr n )) ■ This is 
facilitated via Theorem 14.11 



[z n ] U k (z, s) = K(s) 6 k (n) (ikA*)' 1 ) (1 " 0{\/n)) for some K(s) G C, 

(55) 

uniformly in s and where 9k(n) is some subexponential factor, independent of 
s (we showed that the singular expansion remains invariant when substituting 
ip T (z,s) for z). Therefore 



fxM 



y CTn' 

K(0) 



7fc,r(0) 



e ct " , 



(56) 



uniformly in c, where c is contained in an arbitrary bounded interval. Taking 
the logarithm we obtain 



1 -F l \ 1 *^0>J 1 7fc,T-( <T ) -^r, 

m f Xn \c) ~ In „, n In 7^ « — c . 



K(0) 



7fe,r(0) CT„ 



(57) 



Expanding g(s) = ln(7 fc r (s))/(7fc jT (0)) in its Taylor series at s — 0, (note that 
0(0) = holds) yields 



In 



7fc,- 



.;) _ 7U°) * c 

7fe,r(0) 7fe,r(0) cr n 



TkA0) V7v(0). 



2tr2 



0"r 



0( - ) (5 



and In fx n (c) becomes asymptotically 



, 7^(0) ic 1 

In „ " n 



K(0) 
i\i n c 



7fc,r(0) cr n 2 



7^(0) H^M 

7fc,r(0) l7fc,r(0) ( 



^r + Of — 



(59) 
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XJk(z,s) is analytic in s where s is contained in a disc of radius e around 
and therefore in particular continuous in s for \s\ < e. In view of eq. (1591) we 

set 

7U Q ) 2_/V*,r(°)V 7U Q ) 

/1 " 7*,r(0)' 'WAV) 7fc,r(0) 

Setting /i n = n/i and a 2 = na 2 we can conclude from eq. (|55|) for fixed 
c g] — oo, oof 

lim (lnAT((ic)/(a- n )) - \nK(0)) = (60) 

and eq. (1591) becomes 

ln/x„(c)~-c 2 /2 + 0((( ? c)/a n ) 3 ) (61) 

with uniform error term for c from any bounded interval. This is equivalent 
to lim^oo fx n { c ) — exp(— c 2 /2) uniformly in c. The Levy-Cramer Theorem of 
Section [2] implies now eq. (|50|) and the proof of Theorem 14.21 is complete. □ 
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= 2 


k 


= 3 


k 


= 4 










Vk,T 








T 


= 3 


0.090323 


0.0189975 


0.115473 


0.0086760 


0.123509 


0.0076977 


T 


= 4 


0.071677 


0.0131316 


0.086554 


0.0055685 


0.091737 


0.0049917 


r 


= 5 


0.059591 


0.0098165 


0.069467 


0.0039688 


0.073166 


0.0035769 


T 


= 6 


0.051092 


0.0077233 


0.058149 


0.0026885 


0.060964 


0.0027313 


T 


= 7 


0.044774 


0.0062991 


0.050083 


0.0017584 


0.052319 


0.0021788 






k 


= 5 


k 


= 6 


k 


= 7 










Vk,T 








T 


= 3 


0.128157 


0.0070210 


0.131353 


0.0065187 


0.133748 


0.0061254 


T 


= 4 


0.094768 


0.0020037 


0.119551 


0.0080515 


0.098461 


0.0040797 


T 


= 5 


0.075345 


0.0033114 


0.076864 


0.0031162 


0.078016 


0.0029639 


T 


= 6 


0.062629 


0.0025364 


0.063794 


0.0023936 


0.064680 


0.0022823 


T 


= 7 


0.053648 


0.0020277 


0.054580 


0.0019171 


0.055291 


0.0018310 



Table 1 

The mean /i^ T and variance a\ of the normal limit distributions of the random 
variable X2 T , for different k and r. 
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20 30 40 50 60 70 80 



Fig. 1. Central limit distributions for the stack-numbers in 3-noncrossing r-canonical 
RNA structures for n = 500: we display the limit distribution of Theorem 14.21 for 
r = 3 (black), r = 4 (blue) and r = 5 (red). We also display the exact enumeration 
results obtained via Theorem 13.11 eq. (|10|) represented by o, • and *, respectively. 




12345678 9 10 12345678 9 10 




1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 

(c) 



Fig. 2. /c-noncrossing diagrams: we display the only two nonplanar 3-noncrossing 
diagrams over 10 vertices (upper) and a 3-noncrossing, 3-canonical diagram with 
arc-length A > 4 (lower). 
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123456789 10 11 12 1235689 11 12345678 

Fig. 4. The mapping c: T k T (n, t) — > Ut r <h<L-J ^fc( n ~~ ^C 1 ~~ *)» *) * s obtained in two 
steps: first contraction of the stacks and second relabeling the resulting diagram. 
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